Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery
نویسندگان
چکیده
We present our approach to unsupervised training of speech recognizers. Our approach iteratively adjusts sound units that are ptimized for the acoustic domain of interest. We thus enable the use of speech recognizers for applications in speech domains here transcriptions do not exist. The resulting recognizer is a state-of-the-art recognizer on the optimized units. Specifically we ropose building HMM-based speech recognizers without transcribed data by formulating the HMM training as an optimization ver both the parameter and transcription sequence space. Audio is then transcribed into these self-organizing units (SOUs). We escribe how SOU training can be easily implemented using existing HMM recognition tools. We tested the effectiveness of SOUs n the task of topic classification on the Switchboard and Fisher corpora. On the Switchboard corpus, the unsupervised HMM-based OU recognizer, initialized with a segmental tokenizer, performed competitively with an HMM-based phoneme recognizer trained ith 1 h of transcribed data, and outperformed the Brno University of Technology (BUT) Hungarian phoneme recognizer (Schwartz t al., 2004). We also report improvements, including the use of context dependent acoustic models and lattice-based features, hat together reduce the topic verification equal error rate from 12% to 7%. In addition to discussing the effectiveness of the SOU pproach, we describe how we analyzed some selected SOU n-grams and found that they were highly correlated with keywords, emonstrating the ability of the SOU technology to discover topic relevant keywords. 2013 Published by Elsevier Ltd.
منابع مشابه
Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision
In our previous publication [1], we presented a new approach to HMM training, viz., training without supervision. We used an HMM trained without supervision for transcribing audio into self-organized units (SOUs) for the purpose of topic classification. In this paper we report improvements made to the system, including the use of context dependent acoustic models and lattice based features that...
متن کاملUnsupervised training of an HMM-based speech recognizer for topic classification
HMM-based Speech-To-Text (STT) systems are widely deployed not only for dictation tasks but also as the first processing stage of many automatic speech applications such as spoken topic classification. However, the necessity of transcribed data for training the HMMs precludes its use in domains where transcribed speech is difficult to come by because of the specific domain, channel or language....
متن کاملUnsupervised Audio Patterns Discovery Using HMM-Based Self-Organized Units
In our previous work [1, 2], we trained an HMM-based speech recognizer without transcription or any knowledge or resources. The trained HMM recognizer was used to transcribe audio into self-organized units (SOUs) and we evaluated its performance on the task of topic identification. In this paper, we report our work in applying SOUs to discover audio patterns in spoken documents without supervis...
متن کاملFeature Selection with Kohonen Self Organizing Classification Algorithm
In this paper a one-dimension Self Organizing Map algorithm (SOM) to perform feature selection is presented. The algorithm is based on a first classification of the input dataset on a similarity space. From this classification for each class a set of positive and negative features is computed. This set of features is selected as result of the procedure. The procedure is evaluated on an in-house...
متن کاملSelf-organizing maps and its applications in sleep apnea research and molecular genetics
This paper presents the application of special unsupervised neural networks (self-organizing maps) to different domains, as sleep apnea discovery, protein sequences analysis and tumor classification. An enhancement of the original algorithm, as well as the introduction of several hierachical levels enables the discovery of complex structures as present in this type of applications. Furthermore,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computer Speech & Language
دوره 28 شماره
صفحات -
تاریخ انتشار 2014